Multi-profile Bayesian alignment model for LC-MS data analysis with integration of internal standards

نویسندگان

  • Tsung-Heng Tsai
  • Mahlet G. Tadesse
  • Cristina Di Poto
  • Lewis K. Pannell
  • Yehia Mechref
  • Yue Joseph Wang
  • Habtom W. Ressom
چکیده

MOTIVATION Liquid chromatography-mass spectrometry (LC-MS) has been widely used for profiling expression levels of biomolecules in various '-omic' studies including proteomics, metabolomics and glycomics. Appropriate LC-MS data preprocessing steps are needed to detect true differences between biological groups. Retention time (RT) alignment, which is required to ensure that ion intensity measurements among multiple LC-MS runs are comparable, is one of the most important yet challenging preprocessing steps. Current alignment approaches estimate RT variability using either single chromatograms or detected peaks, but do not simultaneously take into account the complementary information embedded in the entire LC-MS data. RESULTS We propose a Bayesian alignment model for LC-MS data analysis. The alignment model provides estimates of the RT variability along with uncertainty measures. The model enables integration of multiple sources of information including internal standards and clustered chromatograms in a mathematically rigorous framework. We apply the model to LC-MS metabolomic, proteomic and glycomic data. The performance of the model is evaluated based on ground-truth data, by measuring correlation of variation, RT difference across runs and peak-matching performance. We demonstrate that Bayesian alignment model improves significantly the RT alignment performance through appropriate integration of relevant information. AVAILABILITY AND IMPLEMENTATION MATLAB code, raw and preprocessed LC-MS data are available at http://omics.georgetown.edu/alignLCMS.html. CONTACT [email protected]. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Identification of miR-24 and miR-137 as novel candidate multiple sclerosis miRNA biomarkers using multi-staged data analysis protocol

Many studies have investigated misregulation of miRNAs relevant to multiple sclerosis (MS) pathogenesis. Abnormal miRNAs can be used both as candidate biomarker for MS diagnosis and understanding the disease miRNA-mRNA regulatory network. In this comprehensive study, misregulated miRNAs related to MS were collected from existing literature, databases and via in silico prediction. A multi-staged...

متن کامل

Analysis of LC-MS Data Using probabilitic-based mixture regression models (Analyse von LC-MS-Daten mit wahrscheinlichkeitsbasierter Mischung von Regressionsmodellen)

A novel framework of a probabilistic-based mixture regression model (PMRM) is presented for alignment of multiple liquid chromatography-mass spectrometry (LC-MS) data with respect to retention time (RT) and mass-to-charge ratio (m/z). The expectation maximization algorithm is used to estimate the joint parameters of spline-based mixture regression models and prior transformation density models....

متن کامل

Analysis of LC-MS Data Using Probabilistic-Based Mixture Regression Models Analyse von LC-MS-Daten mit wahrscheinlichkeitsbasierter Mischung von Regressionsmodellen

A novel framework of a probabilistic-based mixture regression model (PMRM) is presented for alignment of multiple liquid chromatography-mass spectrometry (LC-MS) data with respect to retention time (RT) and mass-to-charge ratio (m/z). The expectation maximization algorithm is used to estimate the joint parameters of spline-based mixture regression models and prior transformation density models....

متن کامل

Blank measurement based time-alignment in LC-MS

Here are presenting the blank based time-alignment (BBTA) as a strong analytical approach for treatment of non-linear shift in time occurring in HPLC-MS data. Need of such tool in recent large dataset produced by analytical chemistry and so-called omics studies is evident. Proposed approach is based on measurement and comparison of blank and analyzed sample evident features. In the first step o...

متن کامل

Method development for simultaneous determination of 41 pesticides in rice using LC-MS/MS technique and its application for the analysis of 60 rice samples collected from Tehran market

For the first time, a multi-residue methodfor simultaneous determination of 41LC-amenable pesticides in rice, belonging to different chemical classes has been developed in Iran by LC-MS/MS. The pesticides were analyzed simultaneously in a single run using positive electrospray ionization with multiple reaction monitoring (MRM) after extraction with slightly modified QuEChERS method. The calibra...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Bioinformatics

دوره 29 21  شماره 

صفحات  -

تاریخ انتشار 2013